IMAGE EXPANDING APPARATUS AND METHOD THEREOF 

Background of the Invention 
Field of the Invention: 

The present invention relates to an image expanding apparatus and a 
method thereof for restoring a compressed code of an image to an original 
image, in particular, to those according to the JPEG (Joint Photographic 
Experts Group) 2000 system. 

Description of the Related Art: 

The JPEG system, which is a conventional image encoding system, 
uses DCT (Discrete Cosine Transform) and Huffman codes of zero run 
lengths and values of DCT coefficients as engineering elements. Due to 
characteristics of the encoding system, the codes cannot be directly changed. 

Thus, when some change is desired to be made for an image, it is necessary 
to expand the image, change the expanded image, and then re-encode the 
changed image. In contrast, the JPEG 2000 system, which is a new 
encoding system, uses a wavelet transform and a tile architecture that allow 
an image size and an image range to be easily changed. 

The specification of the JPEG 2000 system is described in a web site 
http://www.jpeg.org/public/fcdl5444-l.pdf (reference 1). 

In the JPEG 2000 system, an image is divided into at least one tile. 
Each tile is transformed according to the discrete wavelet transform (DWT). 

Wavelet transform coefficients are arithmetically encoded for each bit map. 

In addition, the wavelet transform is performed for an mage at multiple 
stages. As shown in Fig. 1, an image is divided into an LL1 component, an 



LH1 component, an HL1 component, and an HH1 component. The LL1 
component is sub-divided into an LL2 component, an LH2 component, an 
HL2 component, and an HH2 component. Each component is further sub- 
divided a plurality of times. In the example shown in Fig. 1, the LH1 
5 component, the HL1 component, and the HH1 component compose a 
resolution level R3. The LH2 component, the HL2 component, and the 
HH2 component compose a resolution level R2. The resolution, the image 
size, and the number of gradation levels of the code of the JPEG 2000 
system are scalable. 

10 A code stream of the JPEG 2000 system is composed of one main 

header and a plurality of tile streams. Each tile stream is composed of one 
tile header and one packet stream. Each packet stream is composed of a 
plurality of packets. Each tile stream may be composed of a plurality of 
tile parts. In this case, the beginning tile part is composed of one tile 

1 5 header and a plurality of packets. Each of the second and later tile parts is 
composed of one tile part header and a plurality of packets. 

The main header is composed of SOC (start of code-stream), SIZ 
(image and tile size), COD (coding style default), QCD (quantization 
default), COC (coding style component), QCC (quantization component), 

2 0 RGN (region of interest), POC (progression order change), PPM (Packed 

packet header: main header), PLM (packet length: main header), TLM (tile- 
part lengths: main header), CRG (component registration), and COM 
(comment). However, only SOC, SIZ, COD, and QCD are mandatory. The 
others aire optional. The tile header is composed of SOT (start of tile), COD 



(start of code), QCD (quantization default), COC (coding style component), 
QCC (quantization component), RGN (region of interest), POC (progression 
order change), PPT (packed packet headers: tile-part), PLT (packet length: 
tile-part), COM (comment), and SOD (start of data). However, only SOT 
5 and SOD are mandatory. The others are optional. The tile part header is 
composed of SOT (start of tile), POC (progression order change), PPT 
(packed packet headers: tile-part), PLT (packet length), COM (comment), 
and SOD (start of data). However, only SOT and OSD are mandatory. 
The others are optional. 
1 0 The compound image file format of the JPEG 2000 system is 

described in a web site http://www.jpeg.org/public/ fcdl5444-6.pdf (reference 

2) . With a compound document file, individual portions of a page that 
contains text and an image can be compressed in different compressing 
systems. 

15 According to the reference 2, a compound document file (JPM (JPEG 

2000 File Format - Multi-layer) file) has a structure as shown in Fig. 2. 
One Page box (B.2.1) can contain any number of Layout Object boxes 
(B.3.1). One Layout Object box (B.3.1) may contain at least one Object box 
(B.4.1). 

2 0 Japanese Patent Laid-Open Publication Nos. 2002-152517 (reference 

3) and 2002-152744 disclose 'Image expanding apparatus for transformed 
code" and "Image expanding method for transformed code". 

However, according to the inventions disclosed as the foregoing 
references 3 and 4, a generated parameter of an original image cannot be 



obtained. Thus, an image cannot be optimally expanded. In addition, 
these inventions do not deal with a multiple layout object structure of an 
image. A layout object is defined as Object box (B.4.1) in the reference 2. 

Summary of the Invention 

Therefore, an object of the present invention is to provide an image 
expanding apparatus and a method thereof that allow an expanded image of 
a designated range, a designated resolution, and a designated layout object 
to be obtained from a minimum amount of code. 

According to the present invention, an image expanding apparatus is 
provided, that comprises a coding parameter detecting portion for detecting 
coding parameters from an image code; an expanding parameter 
designating portion for designating expanding parameters; an extracting 
parameter calculating portion for calculating an extracting parameter based 
on the coding parameters and the expanding parameters; and a code 
extracting portion for extracting a code necessary for obtaining an expanded 
image designated by the expanding parameters from the image code with 
reference to the extracting parameter. 

The image expanding apparatus may further comprise an image 
expanding portion for obtaining an expanded image designated by the 
expanding parameters based on the extracted code. 

In the image expanding apparatus, the code extracting portion may 
extract as the extracted code a code having a necessary resolution level of a 
necessary tile of a necessary layout object. 

In the image expanding apparatus, the coding parameters include 



vertical and horizontal sizes in pixels of a page image, the number of layout 
objects in a page, and the size of a tile; the expanding parameters include an 
expanded image size proportional to the resolution of the expanded image, 
the number of layout objects to be displayed; the number of a layout object 
5 to be displayed, and an image display area for each layout object; the 

extracting parameter calculating portion is configured to calculate as the 
expanding parameters a layout object necessary for expanding an image, a 
tile necessary for each layout object, and a resolution level necessary for 
expanding an image based on the coding parameters and the expanding 
10 parameters. 

In the image expanding apparatus, the image expanding apparatus 
deals with a compound document file format according to the JPEG 2000 
system. 

These and other objects, features and advantages of the present 
1 5 invention will become more apparent in light of the following detailed 
description of a best mode embodiment thereof, as illustrated in the 
accompanying drawings. 

Brief Description of Drawings 
Fig. 1 is a schematic diagram describing a discrete wavelet transform 
2 0 according to the JPEG 2000 standard; 

Fig. 2 is a schematic diagram showing the structure of a JPM file; 
Fig. 3 is a block diagram showing the structure of an mage expanding 
apparatus according to an embodiment of the present invention; 

Fig. 4 is a schematic diagram showing a plurality of layout objects 



contained in a JPM file; 

Fig. 5 is a schematic diagram showing a concept of a process for an 
image performed by the image expanding apparatus according to the 
embodiment of the present invention; and 

Fig. 6 is a schematic diagram showing a computer that accomplishes 
an image expanding apparatus according to an embodiment of the present 
invention. 

Description of Preferred Embodiments 
Next, with reference to the accompanying drawings, an embodiment 
of the present invention will be described. 

Fig. 3 shows the structure of an image expanding apparatus 
according to an embodiment of the present invention. Referring to Fig. 3, 
the image expanding apparatus according to the present embodiment of the 
present invention has a coding parameter detecting portion 100, an 
expanding parameter designating portion 200, an extracting parameter 
calculating portion 300, a code extracting portion 400, and an image 
expanding portion 500. 

The coding parameter detecting portion 100 detects coding 
parameters P10 from a JMP file. The coding parameters P10 contains 
vertical and horizontal sizes in pixels of a page image (Lheight (the height 
of a layout object) and Lwidth (the width of a layout object) of Layout Object 
Header box (B.3.2)), layout object information (Nlobj (the number of layout 
objects in a page) of Page Header box (B.2.1.1) (reference 2), and tile 
information (XTsiz (the height of a tile) and YTsiz (the width of a tile) of Siz 



of a code stream contained in Media Data box (B.5.3)). 

In an image code (fall code) CIO shown in Fig. 4, (1) the vertical and 
horizontal sizes (in pixels) of an image are vertical Xsiz [pixels] and 
horizontal Ysiz [pixels], (2) the number of layout objects is 3, (3) the tile size 
(in pixels) of an image is vertical XTsiz [pixels] and horizontal YTsiz [pixels]. 

The expanding parameter designating portion 200 designates 
expanding parameters P20 such as an image display size and an image 
display layout object as an expanded result. The expanding parameters 
P20 contains the expanded image size (Xlsiz [pixels] x Ylsiz [pixels]) 
proportional to the resolution of an expanded image, the number of layout 
objects to be displayed [num_layer], the number of a layout object to be 
displayed (num_layers), and an image display area of each layout object (xs 
[pixels] x ys [pixels]). 

When an image represented by thick frames 311, 312, and 313 in a 
frame 301 on the left of Fig. 5 is restored to an expanded image 110 in a 
frame 304, (1) the vertical and horizontal sizes of the expanded image are 
designated as vertical Xlsiz [pixels] and horizontal Ylsiz [pixels], (2) the 
image display area is designated as a position represented by a thick frame, 
and (3) the number of layout objects is designated as 3. In this case, the 
vertical and horizontal sizes of the expanded image are given as follows: 

Xlsiz = 2 x XTsiz, Ylsiz = 2 x YTsiz ... Eq. 1 

The extracting parameter calculating portion 300 calculates an 
extracting parameter P30 using the coding parameters P10 detected by the 
coding parameter detecting portion 100 and the expanding parameters P20 



designated by the expanding parameter designating portion 200. The 
extracting parameter P30 is used to optimally expand an encoded image. 
In other words, the extracting parameter calculating portion 300 calculates 
tiles of an image display area contained in the expanding parameters P20 
for each layout object using the coding parameters P10 and the expanding 
parameters P20. In the example shown in Fig. 5, a layout object LI is 
contained in a display area of tiles (0, 0) to (3, 3). Layout objects L2 and L3 
are contained in display areas of tiles (0, 2) to (1, 3). In addition, the 
extracting parameter calculating portion 300 obtains a resolution level 
necessary for displaying an image with a resolution represented by the 
expanded image size contained in the enlarging parameter P20. To obtain 
the resolution level, the extracting parameter calculating portion 300 
references the main header, the tile header, and the tile part header of the 
code stream. 

In the example shown in Fig. 5, since the resolution level is the 
second highest resolution level, the code necessary for expanding the image 
is the LL1 component. In addition, the extracting parameter calculating 
portion 300 obtains a necessary layout object. 

Thus, the extracting parameter P30 designates a layout object 
necessary for expanding an image, a tile necessary for each layout objects, 
and a resolution level necessary for expanding an image. 

The code extracting portion 400 has a function for extracting an 
image code (extracted code) C20 to be expanded from the image code (full 
code) C10 corresponding to the extracting parameter P30. In other words, 

8 



the code extracting portion 400 detects an offset address with respect to the 
beginning from OFF of Object Header box (B.4.2), a stream length from 
LEN thereof, and a reference box from DR thereof and whereby detects the 
beginning of the code stream. In addition, the code extracting portion 400 
detects a tile number Isot of each SOT (start of tile) (A.4.2) (reference 1) 
contained in Contiguous Codestream box (B.5.4) (reference 2) of a code 
stream contained in Media Data box (B.5.3) (reference 2) of a JPM file, 
determines whether or not the detected tile number Isot accords with a 
necessary tile and a necessary resolution level designated by the extracting 
parameter P30, and extracts a tile that starts with SOT when the 
determined result is affirmed. The code extracting portion 400 performs 
this operation for the entire code stream. 

To extract a necessary minimal amount of code for the resolution 
level, the code extracting portion 400 calculates the resolution level using 
Capture Resolution box (1.7.3.6.1) (reference 1) of Resolution box (1.7.3.6) 
(reference 1) of jp2 Header Box (1.7.3) (document 1). According to the 
present invention, RLCP (Resolution leveHayer-component-position) 
progression order is supposed. 

The image expanding portion 500 generates an expanded image 110 
(Figs. 3 and 5) corresponding to a tile extracted by the code extracting 
portion 400. Since the image expanding portion 500 has the same 
structure as prior art, the detailed description will be omitted. The image 
expanding portion 500 performs an expanding method according to the 
reference 2. In particular, the image expanding portion 500 expands a 



plurality of layout objects according to the reference 2 and performs 
processes for a foreground, a background, and a mask using flags. 

The coding parameter detecting portion 100, the expanding 
parameter designating portion 200, the extracting parameter calculating 
portion 300, the code extracting portion 400, and the image expanding 
portion 500 can be accomplished by a computer that has a CPU 801, 
memory 802, an input/output interface 803, a communication interface 404, 
an external storing device 805, and a bus 805, which connect them, as 
shown in Fig. 6 and that reads and executes a program that causes the 
computer to perform those portions. 

In addition, a method performed by the coding parameter detecting 
portion 100, the expanding parameter designating portion 200, the 
extracting parameter calculating portion 300, the code extracting portion 
400, and the image expanding portion 500 can be accomplished by a 
computer that has a CPU 801, memory 802, an input/output interface 803, a 
communication interface 404, an external storing device 805, and a bus 805, 
which connect them, as shown in Fig. 6 and that reads and executes a 
program that causes the computer to perform those portions. 

As described above, according to the present invention, since an 
optimum code is extracted corresponding to the size of an image to be 
expanded, as effects of the present invention, (1) the amount of code to be 
processed can be decreased, (2) the use of the memory can be decreased, (3) 
the expanding process can be quickly performed, and (4) the reliability can 
be improved. Although the present invention has been shown and described 

10 



with respect to the best mode embodiment thereof, it should be understood 
by those skilled in the art that the foregoing and various other changes, 
omissions, and additions in the form and detail thereof may be made therein 
without departing from the spirit and scope of the present invention. 
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